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This lifting of claims will replace all prior versions and listings of claims in the 
instant application: 


automatically training a word database used to by a speech recognition engine to 
speech-to-text convert utterances comprising: 

taking u realization of spoken audio and a first representation that is an allegedly 
true textual representation for said realization; 

performing a speech recognition on said realizatio n using the word database 
thereby producing a second representation that is a textual representation for said 
realization; 

aligning said first representation and said second representatio n on a word-bv- 
word basis; 

detecting at least one word aligned in the aligning step that is different in the first 
representation and the second representation : and 

s e lecting oinglo words from said first roproaontation and corresponding aligned 
single words itom said second representation and pairing said aligned single words ; 
wh e r e in said first and said second representations ar c different; and 

automatically updating [[a]]_the word database to include the at least one detected 
word uaing said solootod paired words together with said corresponding speech segment 
extracted from the alig^** realization , wherein the updating step extends the word 
database . 


LISTING OF CLAIMS 


1 . (Currently Amended) 


A method of improving — apoooh — r e cognitio n 


{WPZ15417;3> 
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2. (Currently Amended) The method according to claim 1, wherein said 
sel e cting detecting step uses speech recognition information derived from said speech 
recognition. 

3. (Original) The method according to claim 2, wherein said aligning step reveals 
time information corresponding to the alignment between said realization and said first 
representation. 

4. (Currently Amended) The method according to claim [[2]] 1, said updating 
step further comprising: 

comparing the recognition quality of said speech recognition of said realization 
with the recognition quality of a corresponding single word entry existing in said word 
database. 

5. (Original) The method according to claim 4, wherein said first and said second 
representation*; are comprised of segments, said comparing step further comprising: 
tagging said segments of said first and said second representations where both said first 
and said secon i representations consist of a single word. 

6. (Curren Uy Amended) A method of improving speech recognition 
comprising: 

taking a realization of spoken audio and a first representation that is an allegedly 
true textual rep resentation for said realization; 

performing a speech recognition on said realizatio n using a word database thereby 
producing a second representation that is a textual representation for said realization; 

<WF215417;3) 
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aligning said first representation and said second representation; 

detecting matching word pairs resulting from said alig ning step: and 

fiol o oting flinglo words from said firot representation iind oorr o oponding aligned 
singlo words from aoid aeoond representation and pairing ooid aligned single words, 
wheroin o oid Jirst and said second roprosontations oro idontionl; and 

automatically updating a pronunciation database the word database u sing said 
oolootod paired words matching word pairs together with [[said]] corresponding 
[[aligned]] realization information, wherein the updating step extends the word datahase . 

7. (Currently Amended) The method according to claim 6, wherein said 
solooting detecting step uses speech recognition information derived from said speech 
recognition. 

8. (Original) The method according to claim 7, wherein said aligning step reveals 
time information corresponding to the alignment between said realization and said first 
representation. 

9. (Curren tly Amended) The method according to claim [[7]]_6, said updating 
step further comprising: 

comparing the recognition quality of said speech recognition of said realization 
with the recognition quality of a corresponding single word entry existing in said 
pronunciation database. 

{WP2154W) 
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10. (Origiral) The method according to claim 9, wherein said first and said second 
representations are comprised of segments, said comparing step further comprising; 


{WP215417;3} 
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tagging said segments of said first and said second representations where both said fust 
and said second representations consist of a single word* 

1 1 . (Currently Amended) A system for improving speech recognition of a speech 
recognizer, said system comprising: 

_ft word database used by the speech recognizer to perform speech recog nition 

tasks; 

an aligner configured to align a first representation and a second representation 
produced by said speech recognizer; 

a classifier configured to compare said aligned first representation with said 
aligned second representation;^nd 

a selector configured to select corresponding srogle word pairs from said aligned 
first representation and said aligned second representation , wherein at least one word of 
the single word pairs selected bv th e selector are combined with a realization of spoken 
audio from which said speech re cognizer produced the second rep r esentation, the 
combination biim used to update the word database, wherein the updating step extends 
the word database . 

12. (Currently Amended) The system according to claim 11, wherein said first 
representation and said second representation are differen t- and wherein updating the 
word database res^ljs in new words being ad ded to th e word database 

13, (Origin* 1) The system according to claim 1 1 , wherein said first representation 
and said second representation are identical. 


(WP2tS417a> 
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14. (Currently Amended) The system according to claim 1 1 , further comprising 
means for updating a wo r d database o r a pronunciation database based upon the 
combination amp sinpl e word pairs s e l e cted bv said solooto y. 

15. (Original) The system according to claim 1 1, said aligner further comprising: 
means for generating time information corresponding to time alignment between 

said first representation and said second representation. 

16, (Original) The system according to claim 15, wherein said first and said second 
representations comprise segments, said classifier further comprising: 

means ; ? or tagging said segments of said first representation and said second 
representation where said first representation and said second representation consist of a 
single word, 

1 7, (Currer tly Amended) A machine-readable storage, having stored thereon a 
computer program having a plurality of code sections executable by a machine for 
causing the tmichine to perform the steps of: 

taking a realizatio n of spoken audio and a first representatio n that is an allegedly 
true textual representation for said realization; 

performing a speech recognition on said realization using the word database 
thereby producing a second representation that is a textual representation for said 
realization; 

aligning said first representation and said second representation on a word-by- 
word basis; 

detectin g at least one word aligned in the aligning sto p that is different in the first 
representation and the second representation : and 
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aolootiug ainglc worda from said first roprosontation and corr e sponding alignod 
oinglo wordo from said o ooond repres e ntation rod pairing onid aligned oinglo words, 
wheroin aaid first and said oBooad repr e sentations aro diff e rent; and 

autoim : tical]g_updating [[a]]_die word database to include the at least one detected 
word iusing ocid oolootod paired words together with said corresponding speech segment 
extracted from the aligned realization , wherein the updating step extends the word 
database . 

18. (Currently Amended) The machine-readable storage according to claim 17, 
wherein said s olooting detecting step uses speech recognition information derived from 
said speech recognition. 

19. (Original) The machine-readable storage according to claim 1 8, wherein said 
aligning step rsveals time information corresponding to the alignment between said 
realization and said first representation. 

20. (Currently Amended) The machine-readable storage according to claim 
[[ 1 8]]SL said updating step further comprising: 

compar.ng the recognition quality of said speech recognition of said realization 
with the recognition quality of a corresponding single word entry existing in said word 
database, 

2 1 . (Original) The machine-readable storage according to claim 20, wherein said 
first and said second representations are comprised of segments, said comparing step 
further comprising: 

tagging said segments of said first and said second representations where both said 
first and said socond representations consist of a single word. 

(WP215417-3) 
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22. (Currently Amended) A machine-readable storage, having stored thereon a 
computer projjram having a plurality of code sections executable by a machine for 
causing the m ichinc to perform the steps of; 

taking .* realization of spoken audio and a first representation that is an allegedly 
true textual representation for said realization; 

performing a speech recognition on said realizatio n using a word database thereby 
producing a se cond representatio n that is a textual representation for said realization; 

aligning said first representation and said second representation; 

detecting matching word pairs resulting from said aligning step; and 

s e lecting s ingl e words from said first representation and corresponding aligned 
singlo words f r om said second representation and p ai r ing amd aligned single words, 
wherein s aid first and said second repre se ntation are identical ' and 

automatically updating a pronunciation database the word database u sing said 
selected paired w or d s matching word pairs together with [[said]] corresponding aligned 
realization information, wherein the updating step extends the word database , 

23. (Currently Amended) The machine-readable storage according to claim 22, 
wherein said ss l e oting detecting step uses speech recognition information derived from 
said speech recognition. 

24. (Original) The machine-readable storage according to claim 23, wherein said 
aligning step reveals time information corresponding to the alignment between said 
realization and said first representation. 

25. (Currently Amended) The machine-readable storage according to claim 
[[23]]22_, said updating step further comprising: 

(WP215417;3) 
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comparing the recognition quality of said speech recognition of said realization 
with the recognition quality of a corresponding single word entry existing in said 
pronunciation database. 

26. (Origir al) The machine-readable storage according to claim 25, wherein said 
first and said second representations are comprised of segments, said comparing stop 
further comprising: 

tagging; said segments of said first and said second representations where both said 
first and said second representations consist of a single word. 

27. (New) The method of claim 1 , further comprising the step of: 
obtaining said first representation using an optical character recognition 

technology, 

28 (New) The method of claim 1, wherein the word database is a speaker 
dependent database used to adapt the speech recognition to a particular speaker. 
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